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We present a measurement of the tt differential cross section with respect to the tt invariant mass, 
da/dMfi, in pp coUisions at ^/s = 1.96 TeV using an integrated luminosity of 2.7 fb~^ collected by 
the CDF II experiment. The tt invariant mass spectrum is sensitive to a variety of exotic particles 
decaying into ti pairs. The result is consistent with the standard model expectation, as modeled by 
PYTHIA with CTEQ5L parton distribution functions. 
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The top quark is the only known fermion with a 
mass near the electroweak symmetry breaking (EWSB) 
scale As such, it plays a special role in many be- 
yond the standard model (BSM) theories of EWSB. In 
the standard model (SM) the Higgs boson is responsible 
for EWSB and the generation of the fermion masses, but 
it has not yet been observed. In models with top conden- 
sation, such as technicolor and topcolor, the role of the 
SM Higgs boson is filled by a composite particle that is a 
bound state of top quarks 0. These models predict ad- 
ditional heavy gauge bosons that couple strongly to top 
quarks. The hierarchy problem, also unresolved in the 
SM, has recently been addressed by models with extra 
dimensions, such as the Randall-Sundrum (RS) 0] and 
ADD models In these models TeV-scale gravitons 
can decay, in some cases preferentially, to tt pairs [1] . In 
all of these cases the production of tt pairs at hadron 
colliders through BSM mechanisms distorts the tt invari- 
ant mass spectrum relative to the SM expectation, as 
recently reviewed in Q . 

In this Letter we report on the first measurement of 
the tt differential cross section with respect to the tt in- 
variant mass, da/dMfj. The analysis uses an integrated 
luminosity of 2.70±0.16 fb'^ collected with the CDF 
II detector between March 2002 and April 2008. Full 
details of the analysis presented here are given in Q. 
Previous published studies have focused on searches for 
resonances in the M^j spectrum [§), and placed a lower 
limit of 720 GeV/c^ on the mass of a putative Z' boson 
decaying preferentially to tt. In this Letter we take a 
different approach in which we test the M^j spectrum, 
generically, for consistency with the SM. In this way we 
are potentially sensitive to broad enhancements of the 
spectrum and interference effects as well as to nar- 
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row resonances. 

The CDF II detector is described in detail else- 
where (lo| . The components relevant to this analysis in- 
clude the silicon vertex detector (SVX) , the central outer 
tracker (COT), the central electromagnetic and hadronic 
calorimeters, the central muon detectors, and the lumi- 
nosity counters. 

Because it allows reconstruction of the final state with 
good resolution, and because of the good signal to back- 
ground ratio, we use the "lepton-fjets" decay mode of 
the tt pair in this study. It consists of four energetic jets, 
two of which originate from bottom quarks and two from 
the hadronic M^-boson decay, a charged lepton with large 
transverse momentum (Pt), and a large transverse mo- 
mentum imbalance (^t) from the undetected neutrino 
from the leptonic Ty-boson decay |Tl|. Extra jets may ap- 
pear from initial- or final-state radiation (ISR or FSR). 
A tt event may also be observed with fewer than four 
jets if a jet is not reconstructed or is merged with an- 
other jet in the event. Monte Carlo (MC) simulations of 
tt production are generated using the PYTHIA MC pro- 
gram [l^ with the CTEQ5L parton distribution func- 
tions (PDFs). The decays of heavy quarks {b and c) 
are modeled using EVTGEN [14]. The HERWIG MC pro- 
gram [l^ is used for studies of the systematic effects of 
the hadronization model. The tt MC samples are gener- 
ated with a top quark mass of 175 GeV/c^. The results 
presented here are insensitive to changes of the generated 
top quark mass of a few GeV/c^. 

Events from pp collisions are selected with an inclusive 
lepton trigger that requires an electron (muon) candidate 
with Et > 18 GeV (Pt > 18 GeV/c). From the triggered 
events the signal sample is selected offline by requiring 
an isolated electron (muon) with Et > 20 GeV (Pt > 20 
GeV/c). The isolation criterion requires / < 0.1, where / 
is defined as the calorimeter transverse energy in a cone of 
opening radius AP = (Ar/)^ + (A^)^ = 0.4 around the 
lepton direction (exclusive of the lepton energy) , divided 
by the electron (muon) Et (Pt). We further require 
l^T > 20 GeV and at least 4 jets each with Et > 20 
GeV and | 77 |< 2.0. Jets are identified using a fixed- 
cone algorithm with a cone size of AP = 0.4 and are 
constrained to originate at the pp collision vertex. Their 
energies are corrected to account for detector response 
variations in 77, calorimeter gain instability, nonlinear- 
ity of calorimeter energy response, multiple interactions 
in an event, and for energy loss in un- instrumented re- 
gions of the detector. For events with more than four 
jets with Et > 20 GeV, we use the four highest-Px jets 
in the M^j reconstruction. Missing transverse energy is 
corrected to account for the shifts in jet energies due to 
the jet energy corrections described above. Z-boson can- 
didate events are rejected by removing events containing 
a second isolated high-Px lepton. To suppress the back- 
ground from direct production of a W boson and multiple 
jets (W-l-jets events) we require that at least one jet in 
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the event have an identified displaced secondary vertex, 
consistent with the decay of a i? hadron. We label such 
jets, and the events that contain them, as "5-tagged". 
The events selected prior to the 6-tag requirement are 
called "pretag" events. We observe 2069 pretag, and 650 
6-tagged, events. 

The tt signature described above can be mimicked by 
several processes, including diboson {WW, WZ, ZZ), 
single-top, Z+jets, and VF+jets production, as well 
as processes without vector bosons to which we refer, 
generically, as "QCD" backgrounds. The diboson and 
single-top quark yields are predicted using PYTHIA and 
MADEVENT [16] MC samples, respectively, each normalized 



Process 


4 jets 


> 5 jets 


W+HF 


58.0 ± 12.2 


11.6 ± 2.9 


Mistags 


18.9 ± 4.8 


3.5 ± 1.6 




on n 1 1 ^ r 
ZU.y ± i ( .0 


0.4 ± D.U 


Other 


13.9 ± 0.8 


3.1 ± 0.2 


tt (6.7pb) 


358.6 ± 49.7 


121.5 ± 16.8 


Total Prediction 


470 ± 57 


146 ± 19 


Observed 


494 


156 



to the theoretical cross sections [17|, |l8|. The residual 
Z+jets background is modeled using ALPGEN [l^, with 
the parton showering and underlying event model from 
PYTHIA. The QCD background typically has lower I^t 
than events with real W bosons and is evaluated by fit- 
ting the I^T distribution using templates for QCD and 
T4^-|-jets sources and extrapolating the QCD fraction into 
the high-^T signal region. ALPGEN is also used in the eval- 
uation of the dominant background from M^-f jets produc- 
tion. The M^-|-jets background is determined separately 
for events with and without heavy-flavor jets. For events 
with heavy-flavor jets we use the ALPGEN simulation to 
determine the fraction, in each jet multiplicity bin, of 
VK-l-jets events that are Wbb, Wcc or Wc. This fraction 
is then increased by a correction factor, determined by 
comparing measured and ALPGEN-predicted heavy-flavor 
(HF) fractions in W+1 jet data. The number of pretag 
T4^-|-heavy-flavor events is normalized to the total num- 
ber of M^-l-jets events in each jet multiplicity bin of the 
data using the modified ALPGEN fractions. The back- 
ground contribution from these events is given by the 
pretag number of events times a MC-derived tagging ef- 
ficiency. Events without heavy-flavor jets can enter the 
signal sample if one of the jets is mistakenly 6-tagged. 
Such events are called "mistags" , and they occur primar- 
ily due to tracking errors, with a smaller contribution 
from interactions in the material of the detector, and Ks 
and A decays. The background due to mistags in iy-|-jets 
events is evaluated using a measurement of the rate of 
mistags derived from multijet data The mistag rate 
is then applied to the number of pretag M^-f jets events, 
with no heavy-flavor jets, in the data. This pretag num- 
ber is calculated, in each jet multiplicity bin, from the 
total number of candidate events corrected for the con- 
tributions from non-M^-|-jets and from Ty-fheavy-flavor 
jets. The observed event yields and background predic- 
tions are given in Tab. [H where the line labeled 'Other' 
includes dibosons, Z+jcts and single top. 

The precision of the measurement of M^j depends on 
the understanding of the jet energy scale (JES). To re- 
duce the uncertainty on the JES, we adopt an approach 
first used in [22] and use the measured invariant mass of 
the hadronically-decaying W boson to constrain the JES. 



TABLE I: Summary of sample composition [21 



For events with two 6-tagged jets, the two un-tagged jets 
are chosen as the jets from the W boson decay. For events 
with a single 5-tagged jet, the pair with invariant mass 
closest to the expected mean value from W boson decays 
is chosen. There are 503 single-tagged and 147 double- 
tagged events in the sample. An unbinned maximum 
likelihood fit, using MC templates for the dijet invariant 
mass distribution, for both signal and background, re- 
turns the best-fit JES and its uncertainty. The fit value 
of the JES is subsequently used in the analysis. The un- 
certainty returned by this procedure is approximately a 
factor of two lower than the nominal uncertainty on the 
JES. 

We reconstruct Mf.j, the tt invariant mass, using the 
four-vectors of the 6-tagged jet and the three remaining 
leading jets in the event, the lepton and the transverse 
components of the neutrino momentum, given by I^t- 
We divide the M^j distribution into nine bins between 
and 1400 GeV/c^ , with bin widths ranging from 50 GeV/c^ 
for bins for which a large number of events are expected 
to 600 GeV/c^ for the highest bin. The resolution in M^j is 
somewhat smaller than the bin width, ranging from 11% 
near the peak to 15% at high mass. We subtract from 
the bulk distribution the expected contribution from 
the backgrounds listed in Tab. |T1 which is modeled using 
the Monte Carlo samples described above. The resulting 
signal distribution suffers from resolution smearing 
and is corrected using a regularized unfolding technique, 
described below, which also accounts for the longitudinal 
component of the neutrino momentum. 

In order to extract the true underlying AI^j distribution 
from the background-subtracted reconstructed distribu- 
tion, we use the MC to create a response matrix A, such 
that Ax = d where x is the true, binned distribution and 
d is the measured, binned distribution. Due to statistical 
fiuctuations in bins with small numbers of events, invert- 
ing the response matrix A to solve for x given d often 
yields spurious results. Instead we use singular-value de- 
composition (SVD) unfolding, as described in [23|, where 
the solution is regularized by populating the response ma- 
trix with event multiplicities instead of probabilities. The 
application to this analysis is described in detail in Q. 

From the unfolded M^j distribution, we calculate the 



6 



differential cross section according to 



da' 



dMtj Ai J Cdt- A 



Mr 



where Ni is the background-subtracted, unfolded, num- 
ber of events observed in each bin; Ai is the acceptance 
in bin i; A\,j_ is the width of bin i; and / Ldt is the 
integrated luminosity. The acceptance is measured from 
a mixture of data and MC. We use PYTHIA with a GEANT- 
based [13] detector simulation to measure the geometric 
and kinematic acceptance. The lepton trigger and iden- 
tification efficiencies are measured in data using Z ^ U 
decays. We account for the difference in efficiency for 
identifying an isolated high-Px lepton in data and MC 
with a scale factor. Similarly we use a scale factor to 
correct for the difference in efficiency in data and MC for 
tagging a 6-jet. The efficiency in data is determined in 
a heavy-flavor-enriched data sample of 1ow-Pt electrons, 
from the semi-leptonic decay of B hadrons. 

Our systematic uncertainties arise from MC model- 
ing of the acceptance, true and reconstructed M^j dis- 
tributions, and background distributions. In addition, 
the uncertainties of our efficiency of lepton identifica- 
tion, 6-tagging efficiency, and integrated luminosity affect 
the measurement. The lepton identification uncertainty 
arises due to the extrapolation from Z ^ U events, where 
the efficiency is measured in data, to the higher multi- 
plicity tt environment. The uncertainty on the 6-tagging 
efficiency is largely due to the limited number of events 
in the data sample that is used. These uncertainties, 
together with a small uncertainty due to the finite size 
of the MC simulation used to calculate the acceptance, 
comprise the acceptance uncertainty in Tab. HIl 

We consider several variations to the MC model of 
the signal and background. For the signal MC simu- 
lation we compare the results using HERWIG to the de- 
fault PYTHIA generator. The uncertainty due to the lim- 
ited knowledge of ISR is constrained by studies of ra- 
diation in Drell-Yan events in the data. We vary both 
ISR and FSR within these limits and add the devia- 
tions from the nominal value in quadrature. The uncer- 
tainty due to possible differences in the PDFs from the 
nominal CTEQ5L PDF is evaluated by varying the PDF 
using the 20 CTEQ5L eigenvectors, which represent 90% 
C.L. variations. The deviations from the nominal val- 
ues are added in quadrature with deviations measured 
using the MRST PDF [2J| with two alternate choices for 
the strong coupling constant. The uncertainty on the 
background prediction consists of two pieces: the uncer- 
tainty on the background normalization, given in Tab.[Tl 
and a background shape systematic for the MC model- 
ing of the shapes. The systematic uncertainty due to 
the JES includes a generic energy-correction systematic 
uncertainty as well as a contribution from the modeling 
of the 6-jet energy scale. The effects of each system- 
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FIG. 1: dajdM-ii measured with 2.7 fb ^ of integrated lumi- 
nosity. 



atic uncertainty on the measurement are evaluated us- 
ing a pseudo-experiment approach. Pseudo-experiments 
are performed for each variation described above and the 
difference between the mean da/dM^j in each bin with 
the shifted parameters and the default model is taken 
as the systematic uncertainty in that bin. The results 
are presented in Tab. [Til The dominant systematic un- 
certainty is the uncertainty on the PDF set. This is 
expected as the tail of the M^l spectrum is very sensi- 
tive to the PDFs. The 6% uncertainty on the luminosity 
measurement in each bin is not included in the to- 
tal in Tab. M Two effects cause the uncertainty in the 
bins between 400 GeV/c^ and 550 GeV/c^ to be somewhat 
smaller than outside of that range. One is the turn-on 
threshold of the M^j spectrum, which is insensitive to 
systematic variations because we fix the top quark mass 
at 175 GeV/c^. The second is the PDF uncertainty, which 
is much greater at large M^J than at small M^i. 

The measured da/dM^j is shown in Fig. [1] and tabu- 
lated in Tab. UlTl 

We check consistency with the SM prediction using 
the Anderson-Darling (AD) statistic [1^, which places an 
emphasis on potential discrepancies in the tail of the Mfi 
distribution. The distribution of the AD statistic for this 
analysis is rapidly falling, with small values correspond- 
ing to more likely results. Using MC simulations, we op- 
timize the bin range of the Anderson-Darling statistic for 
maximum sensitivity to new physics and a minimum of 
false positives and find M^j > 450 GeV/c^ to be the most 
sensitive region of M^j. We perform pseudo-experiments 
using the SM MC distributions of Mf.j with the sample 
composition given in Tab. HI We calculate a p-value by 
taking the fraction of pseudo-experiments with a larger 
observed {i.e. less likely in the SM) Anderson-Darling 
statistic than that in data. The observed p-value is 0.28. 
We conclude that there is no evidence of non-SM physics 
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Mrt [GcV/c^] 


0-350 


350-400 


400-450 


450-500 


500-550 


550-600 


600-700 


700-800 


800-1400 


MC Gen. 


0.7 


2.4 


5.3 


5.7 


4.6 


3.3 


1.4 


1.0 


1.0 


ISR/FSR 


1.5 


1.3 


0.8 


0.2 


1.1 


2.1 


2.0 


2.2 


3.3 


JES 


8.2 


6.3 


4.1 


3.1 


1.7 


2.3 


4.6 


7.5 


9.1 


Backgrounds 


10.3 


7.4 


2.4 


1.7 


3.0 


4.0 


4.5 


5.1 


5.4 


Acceptance 


4.5 


4.4 


4.3 


4.4 


4.6 


4.6 


4.4 


4.0 


3.8 


PDF Set 


7.7 


6.1 


3.0 


1.0 


4.8 


9.3 


14.0 


17.4 


18.8 


Total 


16.0 


12.6 


8.9 


8.1 


8.9 


12.0 


16.1 


20.1 


22.2 



TABLE 11: Summary of systematic uncertainties (in %) in each bin. 
included in the total. 



The 6% uncertainty on the integrated luminosity is not 



Mrt [GeV/c^ 



Ai 



da/dMtj [fb/GeV/c^ 



< 350 0.016±0.001 0.47 ± 0.07 ± 0.08 ± 0.03 

350-400 0.023±0.001 62.3 ± 7.0 ± 7.9± 3.7 

400-450 0.026±0.001 33.8 ± 4.0 ± 3.0 ± 2.0 

450-500 0.027±0.001 15.8 ± 3.0 ± 1.3 ± 0.9 

500-550 0.029±0.001 9.9 ± 2.0 ± 0.9 ± 0.6 

550-600 0.030±0.001 5.7 ± 1.2 ± 0.7 ± 0.3 

600-700 0.030±0.001 2.3 ± 0.6 ± 0.4 ± 0.1 

700-800 0.030±0.001 0.8 ± 0.3 ± 0.2 ± 0.1 

800-1400 0.023±0.001 0.068 ± 0.032 ± 0.015 ± 0.004 

Iiitegrated Cross Section j)b] (j.9 ^ i.O (stat.^.JES) 



TABLE III: The acceptance and measured differential cross 
section in each bin. The uncertainties on the cross-section 
values are, respectively, statistical -|- JES, systematic and lu- 
minosity. 



in the M^j distribution. 
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